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Background of the Invention 

1 . Field of the Invention 

[0001] The present invention relates managing consumption of 

utilities, such as electricity, natural gas and water; and more 
particularly to detecting the occurrence of abnormal usage. 

2 . Description of the \Related Art 

[0002] Large buildinqs often incorporate computerized control 

systems which manage the\operat ion of different subsystems, such 
that for heating, ventilation and air conditioning. In addition 
to ensuring that the subsystem performs as desired, the control 
system operates the associated equipment in as efficiently as 
possible. \ 

[0003] A large entity may nave numerous buildings under 
common management, such as on h university campus or a chain of 
store located in different cities. To accomplish this, the 



controllers in each building gather data Regarding performance 



of the building subsystems which data c4n be analyzed at the 



central monitoring location. / 

[0004] With the cost of energy increasing, building owners 



are looking for ways to conserve utility consumption. In 
addition, the cost of electricity for large consumers may be 



based on the peak use during a/billing period. Thus high 
consumption of electricity during a single day can affect the 

p rate at which the service Ls billed during an entire month. 

y3 In addition, certain preferential rate plans require a customer 

I s' / 

O to reduce consumption upon the request of the utility company, 

w / 

Nl such as on days of large service demand throughout the entire 

L utility distribution system. Failure to comply with the request 
Q / 

~g usually results in syciff monetary penalties which raises the 

Si energy cost significantly above that for an unrestricted rate 

M / 

plan. Therefore, /a consumer has to analyze the energy usage in 

order to determine the best rate plan and implement processes to 

ensure that operation of the facility does not inappropriately 



cause an increase in utility costs. 

[0005] In addition, abnormal energy or other utility 



consumption may indicate malfunctioning equipment or other 
problems in the building. Therefore, monitoring utility usage 
and detecting abnormal consumption levels can indicate when 
maintenance or replacement of the machinery is required. 
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[0006] As a consequence, sensors are being incorporated into 

building management systems to measure utility /sage for the 
entire building, as well as specific subsystems such as heating 
air conditioning and ventilation equipment . /These management 
systems collect and store massive quantities of utility use data 
which is overwhelming to the facility operator when attempting 
to analyze that data in an effort to direct anomalies . 

[0007] Alarm and warning systems and data visualization 

programs often are provided to assisst in deriving meaning 
information from the gathered dat^ However, human operators 
must select the thresholds for alarms and warnings, which is a 
daunting task . If the thresholds are too tight, then numerous 
of false alarms are issued, anra if the thresholds are too loose , 
equipment or system failures/can go undetected. The data 
visualization programs can help building operators detect and 
diagnose problems, but a yarge amount time can be spent 
detecting problems . Alsa, the expertise of building operators 
varies greatly. New or/ inexperienced operators may have 
difficulty detecting raults and the performance of an operator 
may vary with the time of day or day of the week. 

[0008] Therefore/ there is a need for robust data analysis 
methods to automatically determine if the current energy use is 
significantly different than previous energy patterns and if so, 
alert the build/ng operator or mechanics to investigate and 
correct the problem. 



Summary of the Invention 

[0009] Abnormal utility usage by a/building or a particular 

apparatus in the building can be determined by repeatedly- 
measuring the level of use of the/utility thereby producing a 
plurality of utility measurements . A Generalized Extreme 
Studentized Deviate (GESD) statistical procedure is applied to 
the plurality of utility measurements to identify any 
measurement outliers. Thef measurement outliers denote times 
^ when unusual utility coi/sumption occurred, thereby indicating 
/3 times during which operation of the building or the particular 
apparatus should b^/ investigated. 

Q 

Cj [0010] In the preferred embodiment, a severity of abnormal 

M, 

1 utility usage can be established by determining a degree to 

H which the associated outlier deviates from the norm. This can 

V 3 

!M be accomplished by calculating robust estimates of the mean 
P 

J 558 * ( x robust) and the standard deviation ( s robust ) of each outlier 

Brief Description of the Drawings 

[0011] FIGURE 1 is a block diagram of a distributed facility 

management system which incorporates the present invention; 
[0012] FIGURE 2 is a box plot of average electrical power 

consumption for a building; 

[0013] FIGURE 3 is a graph depicting the energy consumption 

for a building; and 
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[0014] FIGURE 4 is a flowchart of the algorithm that analyzes 

the energy consumption data for the building. 



Detailed Description of the Invention 

[0015] With reference to Figure 1, a distributed facility 

management system 10 supervises the operation of systems in a 
plurality of buildings 12, 13 and 14. Each building contains 
its own building management system 16 which is a computer that 
governs the operation of various subsystems within the building. 
Each building management system 16 also is connected to numerous 
sensors throughout the building that monitor consumption of 
different utility services at various points of interest. For 
example, the building management system 16 in building 13 is 
connected to a main electric meter 17, the central gas meter 18 
and the main water meter 19. In addition, individual meters for 
electricity, gas, water and other utilities can be attached at 
the supply connection to specific pieces of equipment to measure 
their consumption. For example, water drawn into a cooling 
tower of an air conditioning system may be monitored, as well as 
the electric consumption of the pumps for that unit. 

[0016] Periodically the building management system 16 

gathers data from the sensors arya stores that information in a 
database contained within the .memory of the computer for the 
building management system. / The frequency at which the data is 
gathered is determined by/ the operator of the building based on 
the type of the data and the associated building function. The 
utility consumption for functions with relatively steady state 



operation can be sampled leaf's frequently, as compared to 
equipment large variation*/ in utility consumption. 




[0017] 



The gathered data can analyzed either locally by the 



building management system 16 or forwarded via a communication 
link 20 for analysis by a centralized computer 22. For example, 
the communication link 20 can be a wide area computer network 
extending among buildings in an office park or a university 
campus, or the communication link may comprise telephone lines 
extending between individual stores and the principal office of 
a large retailer. 

[0018] The present invention relates to a process by which 

the data acquired from a given building is analyzed to determine 
abnormal usages of a particular utility service. This is 
accomplished by reviewing the data for a given utility service 
to detect outliers, data samples that vary significantly from 
the majority of the data. The data related to that service is 
separated from all the data gathered by the associated building 
management system. That relevant data then is categorized based 
on the time periods during which the data was gathered. Utility 
consumption can vary widely from one day of the week to another. 
For example, a typical office building has relatively high 
utility consumption Monday through Friday when most workers are 
present, and significantly lower consumption on weekends. In 
contrast, a manufacturing facility that operates seven days a 
week may have similar utility consumption every day. However, 
different manufacturing operations may be scheduled on different 
days of the week, thereby varying the level of utility 
consumption on a daily basis. 



[0019] Therefore, prior to implementing the outlier analysis, 
the building operator defines one or more groups of days having 
similar utility consumption. That grouping can be based on a 
knowledge of the building use, or from data regarding daily 
average or peak utility consumption. For example, Figure 2 is a 
box plot of the average daily electrical power consumption for 
an exemplary building. A similar box plot can be generated for 
the peak electrical power consumption. It is apparent from an 
examination of this graph that consumption during weekdays 

(Monday through Friday) is similar, i.e. the normal consumption 
of electricity falls within one range of levels (A) , and weekend 
periods (Saturday and Sunday) also have similar consumption 
levels that fall within a second range (B) . Therefore, separate 
utility consumption analyses would be performed on data from two 
groups of days, weekdays and weekends. However, different day 
groups would apply to a manufacturing plant in which high 
utility consuming equipment is run only on Tuesdays, Thursdays 
and Saturdays. In this latter example, Tuesdays, Thursdays and 
Saturdays would be placed into one analysis group with the 
remaining days of the week into a second group. 

[0020] Figure 3 depicts the peak daily consumption for this 

building over a period of four weeWs. The weekday peaks are 
significantly greater than the p^ak consumption on the weekends. 
Point 3 0 represents a day when/peak consumption of electricity 
was abnormally high. This may have been caused by a large piece 
of equipment turning on unexpectedly, for example an additional 
chiller of an air conditioning system activating on a single 
very hot day. The da*:a value for this abnormally high level 



is referred to as an ""outlier y^and building operators are 
interested in finding such ouxliers and learning their cause. 
Outliers often result from/equipment of system control 
malfunctions which requi32*e correction. 

[0021] The daily usage pattern for each type of utility 

service can be different. For example, the electricity use in 
a manufacturing facility may be relatively uniform every day 
of the week, but a special gas furnace is operated only on 
certain days of the week. The grouping of days for analyzing 
electricity use in this facility will be different than the day 
groups for gas consumption. As a consequence, each utility 
being monitored is configured and analyzed independently. 

[0022] Focusing on one type of utility service, such as 

electricity use for the entire building, acquisition of periodic 
electric power measurements from the main electric meter 17 
produces a set X of n data samples where X e {x^ ,x 2 ,x 3 ,...,x n } . The 
analysis will find the elements in set X that are outliers, 
i.e., statistically significantly different than most of the 
data samples. This determination uses a form of the Generalized 
Extreme Studentized Deviate (GESD) statistical procedure 
described by B. Rosner, in "Percentage Points for a Generalized 
ESD Many-Outlier Procedure" Technometrics , Vol. 25, No. 2, pp. 
165-172, May 1983. 

[0023] Prior to the analysis the user needs to specify the 

probability a of incorrectljf declaring one or more outliers when 
no outliers exist and an /upper bound (n u ) on the number of 
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potential outliers- The probability a defines the sensitivity 
of the process and is redefined periodically based/on the number 
of false warnings that are produced by the systerar finding 
outliers. In other words the probability is adjusted so that 
the number of outliers found results in an acceptable level of 
warnings of abnormal utility consumption witmin the given 
reporting period, recognizing that false warnings can not be 
eliminated entirely and still have an effective evaluation 
technique- The upper bound (n u ) specifies a maximum number of 

data samples in set X that can be considered to be outliers. 
This number must be less that fift# percent of the total number 
of data samples, since by definition a majority the data samples 
can not be outliers, i.e., n u / 0.5(n-l). For example, a upper 

bound (n u ) of thirty percent/can be employed for electricity 

consumption analysis . 
[0024] The data analysis commences at step 40 by setting the 
initial value n out for mimber of outliers to zero. Then at step 

42 a FOR loop is defined in which the program execution loops 
through steps 44-58 /processing each data sample specified by 
the upper bound n u ,f i.e. samples x if where i = 1, 2, 3, n u . The 
arithmetic mean (?) of all the elements in set X is calculated 
at the first step 44 of this loop. Then at step 46, the 
standard deviation (s) of the elements in set X is calculated. 
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[0025] If the standard deviation is not greater than zero (s 

> 0) , i.e. the samples of utility usage are substantially the 
same as may occur in rare cases, then the pass through the loop 
terminates at step 48 by returning to step 42. Otherwise the 

execution of the algorithm advances to step 50 at which the 
extreme member in set X is located. That extreme element x ei is 
the element in set X that is farthest from the mean x . Using 
that extreme element x ei the computer 22 calculates the i th 
vQ extreme studentized deviate R± at step 52 according to the 

M expression: 
S3 

ly 

M I - 1 

H- R t =i-^ !■ (l) 



~f The / th 100a percent critical value X± then is calculated at step 

Q 

fs£ 54 using the equation: 



, _ (""'X-H; (2) 

fa -i-l + ^_ / _ ls/> ) 



here t n _ t ^ p is the student's t-distribution with (n — i — l) degrees 
of freedom, and a percentile p is determined from: 



w 



P = i- 



< a ^ 
2(»-/ + lX 



(3) 
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[0026] Abramowitz and Stegun, Handbook of Mathematical 
Functions with Formulas , Graphs, and Mathematical Tables, Dover 
Publications, Inc., New York, 1970, provides an process for 
determining the student's t-distribution t vp , for the p th 

percentile of a t-distribution with v degrees of freedom. This 
determination begins by estimating the standardized normal 



deviate f at the p th percentile, according to: 



f = In 



(4: 



y 

Vf 

"v. 

Lj. 



z pS f- 



2.515517 + 0.802853f + 0.010328f 2 
[l + 1.432788f + 0.189269f 2 + 0.001308f- 



(5: 



M 

sy 
Q 



[0027] The student 1 s t-distribution t vp is estimated from z p 

and the degrees of freedom v using the following expressions: 



S3 



84 



96 

= ^ 3z p +19z p +17z p- 15z p) 

= 92T6o' 79 ^ +776z7p +14S2z p " 1920 4 ~ 945z , 



(6) 
(7) 
(8) 
(9) 



/ ~ _ , Si . Si , S3 , Sa 
K, P = Z P + — + — + — + — 

V V V V 



(10) 
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[0028] Upon solving equations (1) and (2) , if at step 56 the 
z th extreme studentized deviate R± is greater than the / th 100a 
percent critical value X± ( > X t ) , then the z th extreme data 

sample x e ,i is an outlier and the number of outliers equals L 

[0029] At step 58, the extreme element x e i removed from 

set X and the number of elements in that set now equals n-i. 
The algorithm then returns to step 42 to repeat the process and 
hunt for another outlier. Eventually the set of data samples 
becomes reduced to the upper bound (n u ) at/ which point the FOR 

loop terminates by branching to step 60/ At that point, the 
outliers have been identified with a set of outliers given by 

X„„, e {x, ...x „ }. If no outliers y/ere found in set X, then 

out <- e,l ? c,2 7 7 e -> n out I 

X out is an empty set . 

[0030] After the outliers have' been identified a robust 
estimate of the mean (x robusi ) andr a standard deviation (s robust ) for 

the set of n data samples X e jpc l9 x 29 x 39 ... 9 x tt } are calculated at steps 

64 and 66. In essence this /determines how far the outliers 
deviate from the remainder/of the data and thus represents the 
severity of the abnormal Xitility consumption denoted by each 
outlier. The process far making this determination commences 

with the set of outliers X out and the set ( X non _ out ) of the data 
samples from set X that are not outliers. Specifically: 

X non-^c{^k eXand ^^ X out } dD 
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[0031] The robust estimate of the mean (x robust ) is the average 

value of the elements in set X non _ out as given by: 



(12) 



robust 



n-n 



out 



where Xj e X 



non-out * 



[0032] 



The robust estimate of the standard deviation (s robust ) 



is the sample standard deviation of the elements in set X 
as defined by the expression: 



standard deviation (s robust ) quantify the severity of the abnormal 
utility usage represented by the corresponding outlier. These 
values can be plotted to provide a graphical indication as to 
that severity by which the building operator is able to 
determine whether investigation of the cause is warranted. 
[0034] For days with abnormal energy consumption, the robust 

estimates of the mean (x robust ) and the standard deviation (s robust ) 
are used to determine how different the energy use is from the 
typical day. One measure is a robust estimate of the number of 
standard deviations from the average value: 




(13) 



[0033] 



The robust estimates of the mean ( x robust ) and the 



P 

?B 3 

Q 

Q 



Zj = X eJ X robust (14) 



where jc c j is the energy consumption for the j outlier, x robust is 
a robust estimate of the average energy consumption for days of 
the same day type as outlier j, and s robust is a robust estimate of 

the standard deviation of energy consumption for days of the 
same day type . 

[0035] The operator can be presented with tables or graphs 

O that show the outliers and the amount of variation for the 
y3 outliers. 
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